﻿ The Use of Referential Constraints in Structuring Discourse Violeta Sereţan♦, Dan Cristea♣ ♦ University of Geneva, Language Technology Laboratory 2, rue de Candolle, CH-1211 Genève 4, Switzerland Violeta Seretan@lettres unige ch ♣ "Al I Cuza" University of Iaşi, Dept of Computer Science 16, Berthelot St , 6600 – Iaşi, Romania dcristea@infoiasi ro Abstract The quality of discourse structure annotations is negatively influenced by the numerous difficulties that occur in the analysis process In contrast, referential annotation resources are considerably more reliable, given the high precision of the existent anaphora resolution systems We present an approach based on the Veins Theory (Cristea, Ide, Romary, 1998), in which successful reference annotations of texts are exploited in order to improve arbitrary structural analyses; in this way, the large amount of corpora annotated at reference level can be used for the acquisition of discourse structure annotation resources We concentrate, instead, on the way the use of 1 Introduction referring expressions restricts the discourse interpretation, Discourse analysis is an important but difficult task, and we intend to use them as disambiguation clues during requiring a lot of effort in order to capture the relations structure derivation, or as structural constraints for between text constituents and thus to decode the author's correcting arbitrary (possibly automatically generated) communicative intentions analyses The popular theory of discourse structure, Rhetorical A reliable reference annotation can be used to impose Structure Theory (RST; Mann and Thompson, 1988) is constraints on the partial or complete discourse structure proved to account for the manner in which the text is built for a text, by means of the prescriptions on the organized in correspondence with the speaker's intention relationship between the discourse structure and Recent research on the relationship between the structure references stated in Veins Theory (VT; Cristea, Ide, of the text and intentions showed (Moser and Moore, Romary, 1998) According to these prescriptions, the 1996; Marcu, 1999) the similarity between the intention-reference chains from text are associated to sets of based discourse structure (Grosz and Sidner, 1986) and structurally related units, the "veins" of discourse The the RST tree-like structure built for a text references from a given unit are mostly to preceding units Despite its popularity and usability, RST lacks that are contained in the unit's vein important prescriptions on criteria for the hierarchical In a well-formed structure, anaphora are expected to aggregation of the pieces of text, this making structural be resolved along the veins If this is not the case, it is analysis an ambiguous task and leading to inappropriate likely that errors have occurred in the process of text interpretations interpretation, which disrupted the structural connection, Many elements provide hints on the discourse i e the path given by the vein, between the pairs of structure, e g delimiters, cue-phrases, time etc , that are concerned units extensively used for the automatic computation of the The goal of our approach is to systematically detect coherence relations (Marcu 1997, Kurohashi and Nagao, and correct the structural errors which are likely to occur 1997) during the structural annotation, and which can be We will focus on those elements that better indicate signalled by the resolution of anaphors outside their the structure of a text: the co-references in it A reference domains of accessibility as given by the structure from an anaphor to its antecedent indicates a structural The next section briefly revises the Veins Theory main relation between the textual units involved The referential concepts and ideas on which our approach is based It is chains in discourse (the repeated references to the same followed by a section that describes in which way the VT discourse entity) contain important information about the prescriptions are used in better structuring discourse, and text organization; therefore, they should also be presents a method for the local and global correction of a considered when structuring discourse structure Afterwards, we present the results of an While largely agreed that there is a straight relation empirical study on a corpus of texts, the conclusions it between the references in a text and its structure (Fox, allows us to draw and, finally, the comparison with the 1987; Vonk et al , 1992), up to now most of the attention related work was concentrated in only one direction, i e on the way in which the process of anaphora resolution is influenced by 2 Global Discourse Cohesion in VT the hierarchical organization of the text For example, in Veins Theory extends and formalizes the relation (Fox, 1987) it is indicated that the treatment of anaphora between discourse structure and reference proposed by should consider the hierarchal structure of texts, and in Fox (1987) Its central notion is the "vein", defined over (Cristea et al , 2000) it is shown that the anaphora discourse structure trees built according to the RST resolution can benefit from following a discourse requirements organization 2 1 The Vein Concept accounts for the cases when the discourse structure is not VT's fundamental assumption is that references in a left-polarized, that is, when a satellite can precede a text occur mostly between units that are in structural nucleus In such a case, the antecedents from the satellite relation with one another, even if they are distant in the should also be accessible for further referring, as long as text the anaphor's unit is not descendent of a right satellite The sets of structurally related units form the main node The interposition of a nuclear node blocks the threads of discourse, called veins, and are defined on the accessibility between the left satellite and the right basis of the nuclearity of the constituents in the discourse subsequent satellites structure built for the text They express the idea that, in We illustrate the computation of the vein on an order to understand a unit in the context of the whole text, example taken from the corpus we examined (see Figure only part of the discourse units, including the one under 1) Unit 7, for instance, is contained in the vein formed by examination, are required Considering that these units units 1, 2 (that are nuclei in the most important preceding form a chain, all references from the examined unit should nodes), 6 (the sister that is a left satellite), and 7 (the unit be resolved along the sub-chain of preceding units In a itself) The vein of unit 8 comprises units 1, 2, 7 (as left-polarized tree, the references are mainly to nuclei before), but not unit 6, as discussed before (nuclear unit 7 rather than to satellites In addition, the vein expression interpose between units 6 and 8) Figure 1: The representation of the RST analysis for a text 1 unit An example is the reference to "Mr Wright" from 2 2 Direct and Indirect Vein References unit 3 to unit 2: the vein of unit 3 is "1 2 3 4" The reference annotation of texts widely adopts the In the case of indirect reference, the unit of the closest convention of textual proximity: if an entity is referred antecedent of an anaphor is not on the vein of anaphor’s more than once in a text, the co-referential links are unit However, a more distant antecedent exists in a unit marked from each anaphor to the closest antecedent in the belonging to the anaphor’s unit vein It is the case of the text that points to the same entity In the text in Figure 1, reference to "Mr Wright" from unit 7: the closest for example, the entity "Mr Wright" is referred to 3 times, antecedent is in unit 3, which is not contained in the vein in units 2, 3 and 7, and the co-referential links are marked expression of unit 7 ("1 2 6 7"), but a previous antecedent from unit 3 in unit 2 and from unit 7 in unit 3 in the same reference chain is in unit 2, present in the This process doesn't follow a discourse structure vein Obviously, the pairs of units involved in an anaphoric In the first case, the target of the co-reference that was relation are structurally connected by some path along the indicated by the annotation corresponds indeed to the coherence relations; the closest the antecedent along this structurally signalled target In the latest case, the target path, the more entitled to be considered as the right target proposed, i e the linearly closest antecedent in the list, of the co-reference link One could find more intuitive to was not the closest one along the structural path mark, in our example, the link from unit 7 directly to unit For both direct and indirect reference types, the 2 anaphor is resolved on its corresponding vein, henceforth VT considers that the relation of co-reference induces this situation is referred to as vein resolution equivalence classes over the set of referential expression, and distinguishes between direct and indirect vein 2 3 Vein Resolution Exceptions references VT indicates that most of the references from a A direct reference situation is that in which the unit of coherent text will be resolved along the veins, directly or the closest antecedent belongs to the vein of the anaphor's indirectly, thus being easier and quicker to interpret 1 The figure is depicted using the RST-Tool (O'Donnell, 1997) The references for which no antecedent from the same - the way the constituents of a rhetorical relation are chain occurs on the vein are supposed to be of identified; pragmatical nature, in the sense that the anaphor can be - whether the speaker intends to assign, for some understood without any antecedent, using the general relations, a more important role to one component world knowledge, like if it were introduced for the first or to the other (which constituent is "nucleus" and time in discourse which one is "satellite"); Anyhow, few cases are reported, in (Cristea at al , - under what conditions two spans of text can be 2000) for instance, where out-of-vein resolution combined into a higher structure, using a rhetorical exceptions are not of a pragmatical nature It is unclear relation whether they are due to error in the structure annotation Like in VT, we consider the structure of discourse We tend to believe that such an exception occurs when being that of a binary tree, therefore we see a rhetorical the relation between the two parts of text given by the relation holding between two sibling descendants Also reference is not conveyed by the structure, and that, like in VT, we ignore the name of relations while keeping probably, a non-valid interpretation was produced in the at value their polarity given by the nucleus-satellite process of discourse analysis dichotomy As such, we abstract away from the relation This is, for example, the case of the structure in Figure name ambiguity as well as from the dispute on relations 1 The reference to the entity "Merrill Lynch Canada Inc " taxonomy, focusing instead on issues of text interpretation from unit 6 to unit 3 is an exception, since unit 3 is not on that are common to all structural theories We claim that the vein of unit 6, which is "1 2 6 7" Later we will see doing that way the approach gains in generality and that this exception is due indeed to structure provides a wider range of applicability If the intention is misconfiguration to acquire a higher specificity, a mapping to a given set of relations could, ultimately, be added 3 From Cohesion To Coherence Using VT Between the factors previously mentioned, the one that Our goal is to perform slight modifications on the we consider that mostly renders difficult the process of current structure configuration (which can be either partial analysis is the size of relation's constituents Human or complete), in the areas indicated by the exceptions in annotators generally agree on the segmentation of text in the vein resolution, in order to enable the referential elementary units and on the relative importance of the accessibility between the two parts of text involved and to constituents (Marcu, Amorrortu, and Romera, 1999), but, reconstruct the structural relation that exists between in what concerns the spans of text the relation entails, the them risk of mistakes is much higher The content of this section is the following: we will Leaving apart the segmentation of discourse, we first look at the causes that could lead, during the consider that, in one step of discourse analysis, two discourse analysis, to structures where a hierarchical possible types of choices affect the well-formedness of relation between two units is not allowed, in spite of the structure configuration, in the absence of criteria for existence of a co-reference from one to another compositionality: Corresponding to the nature of these causes, we will - the manner of assigning the nuclear roles for present several correcting operations which will be constituents, applied on the affected structure, that aim at its - the manner of associating sub-structures in a bigger reconfiguration so that it satisfies the constraints imposed structure, at a higher level in the hierarchy by the references Furthermore, we will show how applying these operations to the original structure 3 2 Discourse Structure Errors indicated by influences the vein-resolution for the involved references Exceptions: two Examples Finally, we show how the local and global corrections We provide two representative examples in which the are applied, looking at the references in the text, in order ambiguities of the above mentioned nature led to to reconfigure an arbitrary structural analysis so that the construction errors that were then signalled by the vein hierarchical relations obey to the constraints given by the resolution exceptions use of references The first example, related to the hierarchical ambiguity, is represented by the structure in Figure 1 in 3 1 Misleading Factors in Discourse Structure section 2 1 The annotator improperly associates two Analysis constituents in the hierarchy, adjoining the structure 6-8 in With respect to text structure, different theories exhibit the root of the already created structure 2-5, and not in the many commonalities For instance the “fix-point” of any node 3-5 situated below on the right frontier The vein of RST-like analysis is given, at least, by: the elementary unit 6 is then affected so that it doesn't contain the unit 3 units of text structure are non-overlapping spans of text, Consequently, the reference to "Merrill Lynch" from unit some textual units play a more important role then others, 6 to unit 3 appears like an exception the abstract structure of a text is a tree (Marcu, 2000) But The correct target for the adjunction of substructure 6- it is also well known that, more often than not, more than 8 to the substructure 2-5 is the node 3-5, as in Figure 2 just one analysis could be drawn, as one should face at Actually, the whole span (2-5) is not in the relation of least the inherent ambiguities of rhetorical structure and elaboration-aditional with the substructure 6-8, but only the scarcity of elaborated theoretical knowledge on the its subspan, 3-5, because the new topic ("no successor was way hierarchical aggregation of structure constituents in named to Mr Wright") elaborates only the topic in an RST tree is to be pursued Indeed, RST ambiguity subspan 3-5 ("Mr Wright resigned as president of Marrill arises in what concerns: Lynch"), and not the topic of the whole span ("Donald - the way the elementary textual spans are defined; Wright was named executive prime vice president at Burns Fry") This is consistent with the satisfaction of the initial structure, this criterion is not obeyed: the relation Compositionality Criterion (Marcu, 1997), which says that doesn't hold between units 2 and 7 a relation that holds between two spans holds also In the correct structure, the reference from unit 6 to between the most salient units of the constituent spans: the unit 3 becomes direct reference on the vein, since the relation also holds between units 3 and 7, while, in the correct vein expression for unit 6 is "1 2 3 4 6 7" Figure 2: The correct structure proposed for the original structure from Figure 1 The second example is related to the nuclearity The referential expression “the top job” in unit 4 refers ambiguity Let's suppose that one assigns a role of S back to the “chairman” position mentioned in unit 1 Still, (satellite) to a left constituent of a relation instead of N a reference from unit 4 in unit 1 is not allowed, since unit (nucleus), by choosing either a relation that is inadequate 1 is not on the vein of unit 4, whose expression is "2 3 4" (from a set of several possible relations), or one whose If unit 1 were nuclear, the reference would have been RST nuclear role assignation is uncertain2 possible When verifying the structure, we found that It could happen that the vein accessibility from an relation r1 is better interpreted as sequence, therefore a anaphor to an antecedent be blocked from further units, binuclear relation, since the intentions realized by its like in the text below extracted from our corpus: constituents are in a relation of satisfaction-precedence (1) At Chrysler, Mr York had been a dark-and not dominance, in terms of the intentional structure horse candidate to succeed departing chairman Lee (Grosz and Sidner, 1986) The correction modifies the A Iacocca vein expression of unit 4 to be "1 2 3 4", therefore (2) Chrysler ultimately chose former General allowing a reference from 4 to 1 Motors Corp executive Robert Eaton There are multiple situations in which the particular (3) The succession struggle left relations choices at each step of structure derivation affect the vein somewhat strained between Mr York and Chrysler accessibility for certain units of text We studied the effect president Robert Lutz, of all types of choices over the vein expression of text's an, (4) who had been another contender for the top units and over the vein-resolution of anaphora (Sereţ job 2000) The original structure associated by the annotators to the text was that in Figure 3, where the nuclear 3 3 Veins-Guided Structure Recovery constituents were underlined The relations r1, r2 and r3 were originally identified as, respectively, background, In the process of correction of an arbitrary discourse elaboration-object-attribute, and consequence: structure, we considered two main types of modifications: an operation related to the hierarchy of constituents in the tree-like text structure, and an operation related to the r3r3r3r3nuclearity of constituents They correspond to the two choices possible in one step of analysis r1r2r1r2r1r2r1r2The manner in which these operations will be applied on the discourse structure will be discussed later, in the 1234123412341234subsection 3 4 Figure 3: The RST structure of the text in Example 1, 3 3 1 Basic Recovery Operations represented as a binary tree Structural operations The first type of structural modification is relatively complex and is inspired by the operations from Tree Adjoining Grammars (Joshi, 1987) All operations obey the sequentiality principle (Cristea 2and Webber, 1997), according to which at any time during In several cases, e g for the consequence relation, it is ambiguous as to which unit is nucleus and which one is satellite analysis the sequence of nodes on the terminal frontier of (Marcu, 1997) the tree corresponds to the sequence of discourse units in the original text It consists of two elementary operations - an indirect reference becomes direct reference5 on the substructure involved: The opposite transformations take place when i a cut operation, of extraction of a sub-tree from one removing units from the vein substructure's daughter; We already encountered an example of transformation ii a paste operation, of adjunction of the tree obtained in subsection 3 2, where an exception became direct vein at the previous step, as left or right auxiliary tree, reference after reconsidering a substructure’s place in the on the opposite (right, respectively left) internal hierarchy frontier3 of the other substructure's daughter Figure 4 below schematizes one type of structural 3 4 Applying the Operations operation, which consists of cutting a sub-tree from the right daughter and adjoining it as left auxiliary tree on the In case the proposed discourse structure is such that right frontier of the left daughter any given anaphor is resolved on its corresponding vein, it means that the relation between the pairs of the units involved (stated by the reference itself) is structurally well-formed, since the vein is a set of related units in the hierarchy of discourse Otherwise, the references are exceptions from the point of view of the vein resolution and could indicate misconfigurations of the structure We ****then try to apply the operations described on the cases indicated by these exceptions 1σ2σ3σ4σ1σ3σ4σ1σ2σ3σ4σ1σ3σ4σ1σ2σ3σ4σ1σ3σ4σ1σ2σ3σ4σ1σ3σ4 σA local correction is performed in the substructure determined by an exception, which is represented by the 2σ2σ2σ2 σcommon parent of the two involved units in the hierarchy The algorithm makes successive tries to put the unit of Figure 4: Type of structural operation one of the antecedents on the vein of the unit of the anaphor An order is considered on the antecedents list, This operation aims at repositioning the constituents of based on the number of other existing references: the more the text in the appropriate place in the hierarchy It is a unit is referred to, the more probable it is to be a nucleus supposed that the ambiguity on the size of relation's or the target of a structural paste operation constituents has led to a situation in which, after several We verify the nuclear roles assignment for the nodes steps of analysis, the right constituent of a relation appears along the path antecedent-root (to see if it is a nuclear as constituent of another, arbitrarily big substructure path) and root-anaphor (to see if it satisfies the right- In the first example from subsection 3 2, this kind of satellite condition) and the constituents' association for the operation is applied in order to obtain the correct nodes on the right and left frontiers of the daughters We structure: the node extracted is the whole right sub-tree (6-perform either nuclear operation or structural operations, 8) of the structure concerned (2-8) where we found inappropriate relation link Nuclearity operations The second type of operation The global correction method consists of successive concerns simple modifications of the nuclear roles error corrections in a linear, bottom-up manner assignment The assumption underlying this transformation is that, among several possible relations, 4 Corpus Study the choice did not fall on the most appropriate of them, In our experiment we used a collection of 25 with consequences on the node's nuclearity newspaper texts from the MUC corpus (Hirschmann and Chinchor, 1997) They contained a two-level annotation: 3 3 2 The Effect on the Reference Type the original MUC co-reference annotation, and an RST We specified the changes these operations of annotation (Marcu, Amorrortu, and Romera, 1999) nuclearity and structural modification have on the vein enriched with information related to the computed veins expressions of the units affected (that are determined by expressions (Ide and Cristea, 2000) means of the nuclear path and right-satellite condition In a pre-processing phase, we manually verified the notions we used4), as well as the effect caused on the vein referential annotation We also corrected several reference These were extensively described in (Sereţan, segmentation errors in the RST annotation6 2000) We applied the correcting operations like in subsection When the changes in the vein expression are such that 3 4 in order to recover the structure, using the constraints new units are added to it, the following transformations of vein resolution of anaphora are possible for the types of vein reference: The results of the corpus study showed that most of the - an exception becomes direct reference; resolution exceptions corresponded indeed to mistakes in - an exception becomes indirect reference; the structure construction (Marcu's Compositionality Criterion was not obeyed) 3 The left (right) internal frontier of a tree consists of the set of the leftmost (rightmost) non-terminal nodes, at all depths in the tree 5 An indirect reference can also be changed so that, although 4 We call nuclear path (in a tree-like structure) a path that remaining indirect, it is resolved in a unit that is closer on the connects the source and the target along nuclear nodes only vein A node in a structure satisfies the right-satellite condition if the 6 We detected a number of 32 referential links in the annotation path that connects it to the root passes through a node that is the that were either missing or not plausibly marked Also, we right satellite of a relation detected too fine unit segmentations in 3 cases We detected 35 cases in which this happened, from the built tree to find both the target to connect a new unit to, 77 exceptions in the vein-resolution That is, 45 5% of the and the antecedents of the anaphora in the new unit Our exceptions correctly indicated errors of configuration approach is more specific with regard to the constraints They were repaired using relatively few correcting imposed to the structure, given that the vein's definition operation and usually a single modification in the elaborates the right-frontier principle (Webber, 1991) structure led to the transformation of multiple exceptions Differences appear both with respect to the target node in in direct or indirect references: 27 basic operations (10 the partial tree the new unit is connected to, with structural and 17 nuclear) were applied in the 35 cases, consequences on the resolution of new anaphora, and so that actually corrected 20 cases (an average of 1 35 on, iteratively We expect our approach to better account operation/case) and sufficed to also correct the rest of 15, for the cases in which their algorithm implausibly predicts or 42 86% from the total the target unit The remaining 42 exceptions did not correspond to Further research involves the investigations on the way structure errors, but were either pragmatical references, to in which additional reference-related factors restrict the entities known from outside the discourse, like "The text structure: the reference type, the kind of anaphor, the White House", "The Senate" etc (11 or 26 1%), or long-evoking power of the anaphor, the distance in text distance name references (9 or 21 4%) Also, 17 cases between anaphor and antecedent involve an attribution relation This big number suggests, as (Ide and Cristea, 2000) remarked, that the attribution 6 References relation's nuclearity should be reconsidered, perhaps getting rid of it entirely and allowing for the inclusion of Cristea, D , 2000 An Incremental Discourse Parser both constituents that it connects into a unique discourse Architecture In D Christodoulakis (Ed ) Proceedings unit The other 5 unresolved cases concern the use of the of the Second International Conference - Natural purpose relation (3 of them) or, interestingly, anaphors Language Processing - NLP 2000, Patras, Greece realized as definite nouns renaming the antecedents (e g Lecture Notes in Artificial Intelligence 1835, Springer "the steelmaker" referring back "Bethlehem Steel Corp ") Cristea, D , and Webber, B L , 1997 Expectations in The results show that the exceptions are good Incremental Discourse Processing In Proceedings of indicators of wrongly build areas in discourse structure the 35th Annual Meeting of the Association for They can also provide indications and suggestions on Computational Linguistics, Madrid several analysis matters Cristea, D , Ide, N , and Romary, L , 1998 Veins Theory: A Model of Global Discourse Cohesion and Coherence 85) 5 Conclusions In Proceedings of COLING-ACL'98 (pp 281− Cristea, D , Ide, N , Marcu, D , and Tablan, M V , 2000 We have shown how VT-derived referential Discourse Structure and Co-Reference: An Empirical constraints apply to the discourse structure configuration Study In Proceedings of the 18th International and can be successfully used in better structuring Conference on Computational Linguistics discourse Although the correction of an initial structure COLING'2000, Saarbrueken may not be complete, in the sense that it is still uncertain Fox, B , 1987 Discourse Structure and Anaphora Written whether the result corresponds or not to the interpretation and Conversational English Cabbridge Studies in the author intends for the text, it is clear that the method Linguistics, Cambridge University Press proposed allows us to provide a better structure which can Grosz, B J , and Sidner, C , 1986 Attention, intentions, also accommodate the structural restrictions imposed by and the structure of discourse In Computational 204 the references Linguistics, 12(3):175− We proposed the basic local correcting operations on Hirschman, L , and Chinchor, N , 1997 MUC-7 Co- the tree-like structure of text, and a global correcting reference Task Definition method that uses the successful reference annotations of Ide, N , and Cristea, D , 2000 A Hierarchical Account of texts in order to improve given structural analyses Referential Accessibility Proceedings of the 38th The method can be applied not only for the correction Annual Meeting of the Association for Computational of human annotation to structure, but also to the analyses Linguistics, ACL'2000, Hong Kong automatically derived, for instance on the basis of cue-Joshi, A , 1987 An introduction to Tree Adjoining words, in order to refine them, or during an incremental Grammar In A Manaster-Rammer (ed ) Mathematics discourse parsing process, in order to guide or assist it of Language We believe that the processes of anaphora resolution Kurohashi, S , and Nagao, M , 1997 Automatic Detection and discourse structure building are interdependent to of Discourse Structure by Checking Surface such a degree that discourse analysis should definitely Information in Sentences In Proceedings of Coling'97 1127) make use of both of them indivisibly, and combine their (pp 1123− partial results to acquire the best analysis In the same way Mann, W C , and Thompson, S A , 1988 Rhetorical that anaphora resolution can benefit from the discourse Structure Theory: Toward a Functional Theory of Text 281 structure, already solved anaphora can be used in Organization Text 8(3):243− determining the new structure, which in turn contributes to Marcu, D , 1997 The rhetorical parsing, summarization the resolution of further anaphora and generation of natural language texts Ph D Thesis The work of (Schauer and Hahn, 2001) uses the same Dept of Computer Science, University of Toronto idea of considering text cohesion to address the coherence Marcu, D , 1999 A formal and computational synthesis of problem of discourse They have proposed an algorithm Grosz and Sidner's and Mann and Thompson's theories for the combined computation of co-references and In Workshop on Levels of Representation in Discourse discourse structure, using the right frontier of the partially Edinburgh Marcu, D , 2000 The theory and practice of discourse parsing and summarization, The MIT Press, Cambridge, Massachusetts Marcu, D , Amorrortu, E , and Romera, M , 1999 Experiments in Constructing a Corpus of Discourse Trees In ACL'99 Workshop on Standards and Tools for Discourse Tagging, Maryland Moser, M , and Moore, J D , 1996 Toward a synthesis of two accounts of discourse structure In Computational Linguistics, 22(3):409−419 O'Donnell, M , 1997 RST-Tool: An RST analysis tool In Proceedings of the 6th European Workshop on Natural Language Generation, Duisburg, Germany Schauer, H , and Hahn, U , 2001 Anaphoric Cues for Coherence Relations In Proceedings of RANLP'2001 (pp 228−235) Sereţan, V , 2000 Considerations regarding the reconciliation of discourse structure with referential chains (in Romanian) Master Thesis, University of Iaşi Vonk, W , Hustinx, L , and Simons, H G , 1992 The use of referential expressions in structuring disocurse In Language and Cognitive Processes, 7(3,4):301−333 Webber, B , 1991 Structure and ostension in the interpretation of discourse deixis In Natural Language and Cognitive Processes 6(2):107–135 